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(54) DNA STRAND USEFUL IN INCREASING CAROTENOID YIELD 

(57) A DNA chain having characteristic of increas- 
ing carotenoid production, and containing the nucle- 
otide sequence which encodes the polypeptide having 
the substantially same amino acid sequence described 
in SEQUENCE ID No. 1 or 2, or a DNA chain which 
hybridizes with said DNA chain, and a method for pro- 
duction for carotenoid characterized by introducing said 
DNA chain into the carotenoid-producing microorgan- 
isms, culturing said transformed microorganism and 
increasing carotenoid content in the culture broth and 
cells. 
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Description 

FIELD OF THE INVENTION 

5 The present invention relates to a DNA chain which provides higher carotenoid content during biosynthesis of car- 

otenoid and a method for producing carotenoids characterized by introducing said DNA chain into carotenoid producing 
microorganism to express said chain and to obtain higher carotenoid content. 

BACK GROUND OF THE INVENTION 

io 

Carotenoid is a general name of a kind of natural pigments. Generally, carotenoids have 40 carbon atoms and con- 
sists of isoprene skeletons, and Carotenoids are abundant in the natural world Approximately 600 kinds of carotenoids 
have been isolated and identified up to the present [(see Key to carotenoids. Basel-Boston, Birkhauser, l987(Pfander, 
H ed.)]. Carotenoids are synthesized through the isoprenoid biosynthetic pathway, a part of which is common to the 

i5 pathways for steroids and other terpenoids. Passing through the isoprene common biosynthetic pathway, hydroxymeth- 
ylglutaryl-CoA(HMG-CoA) is converted to isopentenyl pyrophosphate(IPP), which has 5 carbon atoms, via mevalonate. 
Then IPP is converted to dimethylallyl pyrophosphate(DMAPP) by isomerization. Then, by polycondensation with IPP 
which has 5 carbon atoms, DMAPP is converted sequentially to geranyl pyrophosphate{GPP which has 10 carbon 
atoms), famesyl pyrophosphate (FPP which has 15 carbon atoms), geranylgeranyl pyrophosphate(GGPP which has 20 

20 carbon atoms) and so forth (Figure 1). 

The carotenoid biosynthetic pathway is branched from the isoprene common pathway at the point of GGPP is 
formed. At the point, two molecules of GGPP are condensed to synthesize phytoene which is the first carotenoid and 
colorless. Then, phytoene is converted to lycopene by desaturation reaction. Then, lycopene is converted to p-carotene 
by cyclization Various xanthophylls such as zeaxanthin and astaxanthin are synthesized by introducing hydroxyl 

25 groups or keto groups to p-carotene. 

Recently, the inventors of the present invention cloned the carotenoid biosynthesis genes derived from Erwinia ure- 
dovora . which is a non-photosynthetic epiphytic bacterium in Escherichia coli by using yellowish color of Er uredovora 
as markers and elucidated the functions of the genes. Then, various combinations of these genes are introduced to 
express, and it made possible that microorganisms such as E CQh and yeast produce phytoene, lycopene, p-carotene, 

30 zeaxanthin and so forth(See Figure 2): [See Misawa, N., Nakagawa, M., Kobayashi, K., Yamano. S., Izawa. Y., Naka- 
mura, K. and Harashima, K., "Elucidation of the Erwinia uredovora carotenoid biosynthetic pathway by functional anal- 
ysis of gene products expressed in Escherichia coli ". J. Bacteriol.. 172: 6704-6712 (1990); Misawa, N., Yamano, S., and 
Ikenaga. H., "Production of p-carotene in Zvmomonas mobilis and Agrobacterium tumefaciens by introduction of the 
biosynthesis genes from Erwinia uredovora ". Appl. Environ. Microbiol.. 57: 1847-1849 (1991); Yamano, S.. Ishii, T, 

35 Nakagawa, M., Ikenaga, H., and Misawa, N. ( "Metabolic engineering for production of p-carotene and lycopene in Sac- 
charomvces cerevisiae ". Biosci. Biotech. Biochem., 58: 1112-1114 (1994) and Japanese Patent Application laid-open 
No. HE! 3-58786(Japanese Patent Application filing No. HEI 2-53255):"A DNA chain useful for synthesis of carotenoids" 
by the inventors of the present invention]. With the carotenoid biosynthesis genes from Er. uredovora . carotenoids can 
be synthesized from FPP. Since FPP is the common substrate not only for carotenoids but also for steroids and other 

40 terpenoids, bacteria incapable of synthesizing carotenoids also have FPP. Accordingly, for example, when four £rj 
genes, crtE, £fjB, crtl and £rtY, which are necessary for biosynthesis of p-carotene from FPP are introduced in micro- 
organisms, the microorganism becomes capable of producing p-carotene (See Figure 2). Furthermore, by the same 
procedures as mentioned above, the inventors cloned the carotenoid biosynthesis genes derived from a marine bacte- 
rium, Agrobacterium aurantiacum in E coH. By expressing various combinations of the genes from the bacterium and 

45 those from the above-mentioned Er. uredovora . it made possible that the microorganisms such as E £oli produce 
astaxanthin, canthaxanthin and so forth (See Figure 3): (Norihiko Misawa et al., "Elucidation of an astaxanthin biosyn- 
thetic pathway at the level of the biosynthesis genes", Abstract of the 36th Symposium on the chemistry of natural prod- 
ucts: 175-180 (1994)). Among the above carotenoids, astaxanthin, zeaxanthin and p-carotene are already in practical 
use and are regarded as promising substances. They are used for food or feed additives as red or yellow natural color- 
so ing agents or as nutritional aid having cancer prophylactic activity, immunopotentiating activity or provitamin A activity. 
Accordingly when the carotenoid biosynthesis genes obtained by the inventors is used as exogenous genes for trans- 
forming microorganisms such as coli to express, it gave microorganisms such as £L GQli the capability of biosynthesis 
for producing useful carotenoids. Up to now, it is the only way to improve production of useful carotenoids was to find 
out microorganism which can synthesize sufficient amount of a targeted carotenoid, and to try to increase its production 

55 by investigating culture conditions or mutation treatment. Owing to the studies done by the inventors, it became possible 
to choose host microorganism which is cultured easily and proliferates rapidly, and is guaranteed to be safe for food 
regardless of its carotenoid producing capability. As a matter of course, it is also possible to use microorganisms which 
can synthesize sufficient amount of useful carotenoids originally. In such a case, by transforming the microorganisms 
with carotenoid biosynthesis genes, it became possible to obtain higher carotenoid production or to alter final caroten- 
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oid products. For example, when both £rjW and crlZ genes from Ag^ aurantiacum were introduced into a microorganism 
capable of producing p-carotene as a final product to express them, the microorganism was transformed to another one 
which produce astaxanthin as a final product. 

On the other hand, both astaxanthin and p-carotene can also be synthesized by organic synthesis methods. In 

5 these cases, considering these carotenoids are used for feed or food additives, there is problems that by-products are 
also produced and such synthetic products are not preferred by consumers because they prefer natural products. How- 
ever, carotenoids produced by the conventional fermentation methods could not compete with those by the organic syn- 
thesis methods in price. As mentioned earlier, when the above mentioned carotenoid biosynthesis genes are used, it 
improves the fermentation methods, thereby it is considered that the carotenoid produced by the fermentation methods 

w will be able to compete with those by the organic synthesis methods in price. If the microorganism can accumulate 
enough amount of carotenoid in itself, the carotenoid produced by the microorganisms will succeed in such price com- 
petition. Therefore, a technology to obtain higher carotenoid content by using microorganisms has been longed for. 

Until now. in order to obtain higher carotenoid production in its biosynthesis; the traditional random mutation 
method is only employed to select mutant strains having higher carotenoid content with mutagenic agent such as NTG. 

?5 However, this method requires huge amount of time and labor of technicians. In addition, even if enhancement of car- 
otenoid synthesis is successfully achieved, the method requires both huge amount of time and effort to inhibit decreas- 
ing of carotenoid content caused by frequent reverse mutations naturally happens because the method lacks its 
theoretical basis. 

20 SUMMARY OF THE INVENTION 

The object of the present invention is to increase amount of carotenoids biosynthetically produced by microorgan- 
isms. 

To solve the above problem, the inventors have investigated the problem thoroughly and developed a novel tech- 
25 nology which provides several times higher carotenoid production amount by introducing a DNA chain containing only 
one gene into a carotenoid producing microorganism to express the gene in them. 

More specifically, the inventors of the present invention found the foMowings and completed the present invention. 
When a DNA chain containing a gene substantially encoding an amino acid sequence of IPP isomerase which converts 
IPP into DMAPP, is introduced in microorganisms such as E. coji having carotenoid synthesis gene derived from Er 
30 uredovora and so forth, content of carotenoid in cells such as lycopene and p-carotene becomes 1.5-4.5 times higher 
than that in control cells can be achieved. The gene substantially encoding IPP isomerase amino acid sequence which 
converts IPP into DMAPP was obtained from the astaxanthin producing microorganisms such as Phaffia rhodozvma 
and Haematococcus pluvialis . 

The characteristics of the DNA chain of the present invention are as follows. 

35 

(1) A DNA chain capable of increasing carotenoid production amount and containing the nucleotide sequence 
which encodes the polypeptide having the amino acid sequence substantially shown in Sequence ID No. 1, or a 
DNA chain that can be hybridized with said DNA chain. 

(2) A DNA chain capable of increasing carotenoid production and containing the nucleotide sequence which 
40 encodes the polypeptide having the amino acid sequence substantially shown in Sequence ID No. 2, or a DNA 

chain that can be hybridized with said DNA chain. 

The present invention also relates to a method for carotenoid production. The characteristics of the carotenoids 
production methods of the present invention are as follows. 

(3) A production method characterized by introducing the DNA chain mentioned above either (1 ) or (2) into carote- 
45 noid producing microorganism, culturing said transformed microorganism and increasing carotenoid content in the 

cells and culture broth. 

(4) A production method characterized by introducing the DNA chain containing the nucleotide sequence which 
encodes the polypeptide having the substantially same amino acid sequence shown in Sequence ID No. 3, or a 
DNA chain that can be hybridized with said DNA chain into carotenoid producing microorganism, culturing said 

so microorganism and increasing carotenoid content in the cells and culture broth. 

The present invention is described herein below. 

As described in before, by introducing the carotenoid biosynthesis gene derived from microorganisms such as 
Erwinia uredovora . the non-photosynthetic soil bacteria and Aqrobacterium aurantiacum , the marine bacteria) into 
55 other microorganisms which do not produce carotenoids such as E coh, the microorganism can produce useful caro- 
tenoids such as astaxanthin, zeaxanthin, p-carotene and lycopene. In order to compete in price of the carotenoid pro- 
duced by using the organic synthesis methods, it is necessary to achieve as higher carotenoid production as possible. 
The IPP isomerase gene, which include the gene encoding the polypeptide whose amino acid sequence is substantially 
IPP isomerase, of the invention is extremely useful for increasing the production amount of carotenoids. By using mod- 



3 

3NSDOCID' <EP 0769SS1 A1 _l_> 



EP 0 769 551 A1 



ern biotechnology, it is relatively easy to increase production amount of a protein encoded by an exogenous gene by 
enhancing expression level of the gene. However, rf amounts of substrate necessary for a protein, that is enzyme, is 
limited, higher production of the protein does not lead to higher production of biochemicals such as carotenoids. For 
example, without sufficient amount of FPP, which is the first substrate, enhancement of expression level of the carote- 

5 noid synthesis genes does not lead to higher amount of carotenoids production. This time, we succeeded in increasing 
carotenoid production amount by introducing the IPP isomerase gene. It is considered that the introduction of the IPP 
isomerase gene makes the flow of, the upstream of the pathway up to FPP larger(more efficient) and consequently, 
increased supply of FPP led to higher carotenoid production amount. The present invention started from the findings 
that by introducing either the gene encoding IPP isomerase. which convert from IPP to DMAPP vise versa, or encoding 

io the protein homologous to IPP isomerase into carotenoid producing microorganism such as EL cjpji, to express the 
gene, carotenoid production amount is increased. By using carotenoid biosynthesis genes from Er uredovora . cDNA 
expression libraries of Phaffia rhodozvma . Haematococcus pluvialis and so forth were prepared in p-carotene produc- 
ing E Qpji as a host. As increased p-carotene content in E. coli made, some of the yellowish colonies brighter till almost 
orange. The plasmids extracted from such £oJi colonies were analyzed and were found to have genes with high 

75 homology to IPP isomerase of Saccharomyces cerevisiae . It has been speculated that HMG-CoA reductase(Figure 1), 
which catalyzes the reaction from HMG-CoA to mevalonate. may be the rate limiting enzyme for terpenoids including 
carotenoids. However, as for IPP isomerase. any such report has not been presented. Therefore, increase of carotenoid 
production by introducing a IPP isomerase gene was a new finding. 

The present invention provides a DNA chain having characteristics of increasing carotenoid production amount, 

20 and it containing the nucleotide sequence which encodes the polypeptide having the substantially same amino aid 
sequence as those of IPP isomerase. and a production method for carotenoid characterized by introducing said DNA 
chain into the carotenoid producing microorganism, culturing said transformed microorganism and increasing caroten- 
oid content in the culture broth and cells. 

The DNA chains of the present invention includes the DNA chains mentioned above (1) or (2), or the DNA chains 

25 which hybridize to said chains under stringent conditions. 

Substantially, the polypeptides encoded by the DNA chains of the present invention have the amino acid 
sequences shown in SEQUENCE ID No. 1 (A-B in Figures 4 and 5) or in SEQUENCE ID No. 2(C-D, in Figures 6 and 7). 
In the present invention, the polypeptides encoded by these DNA chains, the proteins of which amino acid sequence is 
substantially IPP isomerase, may be altered by deletion, replacement, addition and so forth of some amino acids, as 

30 long as the resulted polypeptides hold their higher carotenoid production activity. This allowance corresponds to "having 
the substantially same amino acid sequence substantially shown in SEQUENCE ID No. 1 or No. 2". As an example, a 
sequence which lacks the first amino acid(Met) can be included as the altered polypeptide or the altered enzyme. Need- 
less to say, the DNA chains of the present invention include not only the chains having the nucleotide sequences which 
encode the amino acid sequences shown in SEQUENCE ID No. 1 and 2(Figures 4 to 5), but also the degenerate iso- 

35 mers of the chains, which differs only on degenerate codons and encode the same polypeptides as the original chains 
do. 

(1) Obtaining the DNA chains 

40 One method to obtain a DNA chain having the nucleotide sequence which encodes the amino acid sequence of the 
above protein is chemical synthesis of the DNA chain at least a part of the chain according to the known nucleic acid 
synthesis method. However, considering that there are so many amino acids bound in the protein, it would be more 
preferable than chemical synthesis to make cDNA libraries of Haematococcus pluvialis or Phaffia rhodozyma or the like 
to obtain a targeted DNA chain by applying some popular method in the field of genetic engineering such as hybridiza- 

45 tion with appropriate probes. 

(2) Transformation of microorganisms such as E, coli and expression of gene 

Higher carotenoid content in culture broth or cells of microorganisms can be achieved by introducing the above 
so mentioned DNA chain of the present invention into appropriate microorganisms such as carotenoid -producing bacteria 
such as EL cpji and Zvmomonas mobilis containing carotenoid biosynthesis genes from Erwinia uredovora and so forth, 
or carotenoid-producing yeast such as Saccharomyces cerevisiae containing carotenoid biosynthesis genes from 
Erwinia uredovora and so force. 

The outline of the method to introduce exogenous genes into preferable microorganisms is mentioned below. 
55 Procedures or methods to introduce and express exogenous genes in microorganisms such as E c^i, besides 
those mentioned below in the present invention, includes those widely used in the field of genetic engineering. Those 
are applicable to the invention. See "Vectors for cloning genes", Methods in Enzymology, 216: 469-631 (1992), Aca- 
demic Press; "Other bacterial systems", Methods in Enzymology. 204: 305-636 (1991) Academic Press). 
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[LcolD 

There are some established and efficient methods to introduce exogenous genes to E coh such as Hanahan's 
method and rubidium method, and they are applicable to the present invention (See Sambrook, J., Fritsch, E. F, Mani- 

s atis, T. ( "Molecular ctoning-A laboratory manual". Cold Spring Harbor Laboratory Press (1989)). Expression of exoge- 
nous genes in E coli can be performed by known methods (See "Molecular cloning-A laboratory manual", ibid.), for 
example, vectors for E co|i such as pUC and pBluescript vectors having lac promoter can be used. The inventors of the 
present invention used pSPORTl vector or pBluescript II KS vector having lac promoter as vectors for E £0_li, and 
inserted the IPP isomerase gene, derived from Haematococcus pluvialis . Phaffia rhodozvma or Saccharomvces cere- 

10 visiae . into the lac promoter with the direction of reading through of the transcription, and expressed the gene in coli . 

[Yeast] 

There are some established methods such as the lithium method to introduce exogenous genes into Saccharomv- 
is ces cerevisiae . yeast, and such methods are applicable to the present invention (See "New biotechnology on yeast", 
Ed. Bio-industry Association (Yuichi Akiyama. editor in chief), Igaku Syuppan Center). Expression of exogenous genes 
in yeast can be performed as follows. Using both promoters and terminators, e.g. for PGK and GPP, an expression cas- 
sette is constructed by inserting the exogenous gene so that during transcription, the gene is to be read through at the 
position between the promoter and the terminator. Expression can be performed by inserting the expression cassette 
20 into a vector for £. cerevisiae such as YRp vectors (multi-copy vectors for yeast, replication starts at ARS sequence of 
yeast chromosome), YEp vectors (multi-copy vectors for yeast, replication starts at 2\im DNA) and Yip vectors (vectors 
for yeast chromosome, no starting point of replication in yeast) (See "New biotechnology on yeast", ibid.; "Genetic engi- 
neering for production of substances", Ed. Japanese Society of Agrocultural Chemistry, Asakura Publishing company; 
or Yamano, S., tshii, T, Nakagawa, M., tkenaga, H., Misawa, N., "Metabolic engineering for production of p-carotene 
25 and lycopene in Saccharomvces cerevisiae" , Biosci. Biotech, Biochem., 58: 1 1 12-1 1 14 (1994)). 

[Zymomonas mobilis] 

Introduction of exogenous genes into Zymomonas mobilis. the ethanol -producing bacterium can be performed by 
30 conjugal transfer method which is commonly used for gram negative bacteria. Expression of exogenous gene in 
Zymomonas mobilis can be performed by using pZA22 vector for this bacterium (See Katsumi Nakamura, "Molecular 
breeding of Zymomonas bacteria", Journal of the Japanese Society of Agrocultural Chemistry, 63: 1016-1018 ( 1989); 
and Misawa, N., Yamano, $., Ikenaga, H., "Production of p-carotene in Zymomonas mobilis and Aqrobacterium tume- 
faciens by introduction of the biosynthesis genes from Erwinia uredovora ". Appl. Environ. Microbiol., 57: 1847-1849 

35 (1991)). 

(3) Method to increase carotenoid production in microorganisms 

By applying the above mentioned procedures or methods for introduction and expression of exogenous genes m 

40 microorganisms, both the carotenoid synthesis genes and the IPP isomerase gene can be introduced to express, and 
microorganisms capable of producing large amount of carotenoid can be obtained. 

Farnesyl pyrophosphate (FPP) is the common substrate not only for carotenoids but also for other terpenoids such 
as sesquiterpenes, triterpenes, sterols and hopanols. In general, since microorganisms are synthesizing terpenoids 
even though they are not capable of synthesizing carotenoids, basically all of the microorganisms possesses FPP as 

45 an intermediate metabolite. On the other hand, Erwinia uredovora . the non-photosynthetic bacterium having the caro- 
tenoid synthesis genes can synthesize up to several useful carotenoids such as lycopene, p-carotene, zeaxanthin by 
using FPP as a substrate. When the genes are combined with the carotenoid synthesis genes of Agrobacterium auran- 
tiacum , the marine bacterium, up to several useful carotenoids such as cantaxanthin and astaxanthin can also be syn- 
thesized (See Figures 2 and 3). The inventors of the present invention already confirmed that by introducing cri genes 

so of Erwinia uredovora into microorganisms such as Saccharomvces cerevisiae . yeast and Zymomonas mobilis. ethanol- 
producing bacteria; these microorganisms can produce carotenoids such as p-carotene as anticipated [Yamano, S., 
Ishii, T, Nakagawa, M., Ikenaga, H., Misawa, N., "Metabolic engineering for production of p-carotene and lycopene in 
Saccharomyces cerevisiae", Biosci. Biotech, Biochem., 58:1112-1114 (1994); Misawa, N., Yamano. S., Ikenaga, H., 
"Production of p-carotene in Zymomonas mobilis and Aqrobacterium tumefaciens by introduction of the biosynthesis 

55 genes from Erwinia uredovora ", Appl. Environ. Microbiol., 57:1847-1849 (1991); and Japanese laid-open Patent Appli- 
cation No. HEI 3-58786(Japanese Patent Application filing No. HEl 2-53255):"A DNA chain useful for synthesis of car- 
otenoids" by the inventors]. 

From the above findings, it can be expected that when an appropriate combinations of the carotenoid synthesis 
genes derived from El uredovora and those from marine bacteria(typically the carotenoid synthesis genes derived from 
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Aq. aurantiacum) are introduced into the same microorganism simultaneously, as a principle, all of the microorganisms, 
in which such genes are introduced and of which introduction-expression system is established, can produce useful 
carotenoids such as astaxanthin and zeaxanthm. 

In such cases, rf the IPP isomerase gene(typically. derived from Haematococcus pluvialis . Phaffia rhodozyma and 
Saccharomvces cerevisiae) is introduced according to the above mentioned method, and is expressed concomitantly 
with the above carotenoid synthesis gene, higher production amount of useful carotenoids can be achieved. 

(4) Deoosrt of the microorganisms 

The recombinant E. coli strain JM109 has been deposited as follows with the National Institute of Bioscience and 
Human-Technology, the Agency of Industrial Science and Technology The strain contains the plasmid having the iso- 
lated gene which is the DNA chain of the invention. The names of the plasmids are shown in the parentheses. 

(i) JM109(pRH1) 

Deposit No.: FERM BP-5032 
Date of Receipt: March 6th, 1995 

(ii) JMl09(pHPl1) 
Deposit No.: FERM BP-5031 
Date of Receipt: March 6th, 1995 
(ii) JM109(pSI1) 

Deposit No.: FERM BP-5033 
Date of Receipt: March 6th. 1995 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows the isoprene common biosynthetic pathway from HMG-CoA to FPR 

FIGURE 2 shows the carotenoid biosynthetic pathway, and the functions of the carotenoid synthesis genes of 
Erwinia uredovora . the non-photosynthetic bacterium. 

FIGURE 3 shows the carotenoid biosynthetic pathway, and the functions of the carotenoid synthesis genes of Agro- 
bacterium aurantiacum . the marine bacterium. The solid line shows major biosynthetic pathway and the dotted line 
shows minor one. 

FIGURES 4 and 5 shows the nucleotide sequence of the IPP isomerase gene and the amino acid sequence of the 
polypeptide encoded by said gene of Phaffia rhodozyma . the astaxanthin-producing yeast. In the Figure, the sequence 
from mark A to B shows the open reading frame encoding the polypeptide consisting of 251 amino acids. 

FIGURES 6 and 7 shows the nucleotide sequence of the IPP isomerase gene and the amino acid sequence of the 
polypeptide encoded by said gene of Haematococcus pluvialis. the astaxanthin-producing green alga. In the Figure, the 
sequence from mark C to D shows the open reading frame encoding the polypeptide consisting of 259 amino acids. 

FIGURES 8 and 9 shows the nucleotide sequence of the IPP isomerase gene and the amino acid sequence of the 
polypeptide encoded by said gene of Saccharomvces cerevisiae. the yeast for laboratory use. In the Figure, the 
sequence from mark E to F shows the open reading frame encoding the polypeptide consisting of 288 amino acids. 

FIGURE 10 shows the plasmids containing the carotenoid biosynthesis genes of Erwinia uredovora . the non-pho- 
tosynthetic bacterium. 

FIGURE 1 1 shows the plasmids containing the IPP isomerase gene of Phaffia rhodozyma . Haematococcus pluvia- 
lis, or Saccharomvces cerevisiae . 

FIGURE 12 shows the growth curve in the culture broth of the lycopene producing E coli strains(L). In the Figure, 
"control" means the JL £Oji strain having no exogenous IPP isomerase gene. 

FIGURE 13 shows the lycopene production curve in the culture broth of the lycopene producing ^cjoji strains(L). 
In the Figure, "control" means the E £o_li strain having no exogenous IPP isomerase gene. 

FIGURE 14 shows production of lycopene(L:), p-carotene(p:) and phytoene(P:) in the cultured cells of the EL coli 
strains. In the Figure, "control" means the E cgji strain having no exogenous IPP isomerase gene. 

EXAMPLE 

The following examples illustrate the present invention in more detail, however, the present invention is not limited 
to them. The genetic recombination experiments used here are based on the standard methods(Sambrook. J., Fritsch, 
E. F. Maniatis. T, "Molecular cloning-A laboratory manual", Cold Spring Harbor Laboratory Press (1989)) unless oth- 
erwise stated. 
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(EXAMPLE 1) Biological materials and culture conditions 

Phaffia rhodozvma ATCC 24230 Strain(Astaxanthtn-producing yeast) registered at the American Type Culture Col- 
lection(ATCC) is used. YM media(yeast extract 0.3%. malt extract 0.3%, bactopeptone 0.5%, Glucose 1%) is used for 
Ph. rhodozyma . Haematococcus pluvialis . the astaxanthin-producing green alga. NIES-144 strain registered at the Glo- 
bal Environmental Forum is used. Ha, Pluvialis is cultured at 20°C for about 4 days in basic culture media(yeast extract 
0.2%, sodium acetate 0. 12%, L-asparagin 0. 04%, magnesium chloride hexahydrate 0. 02%. ferrous sulfate heptahy- 
drate 0.001%. calcium chloride dihydrate 0.002%) under 12 hr light (20 nE/m 2 s)/12 hr dark condition. Furthermore, in 
order to induce astaxanthin synthesis in Ha^ pluvialis. cyst formation, a kind of differentiation, has to be induced. To 
induce cyst formation, both acetic acid 45 mM and ferrous sulfate heptahydrate 450 nM at final concentrations are 
added. pluvialis in the media is cultured for about 12 hr at 20°C with light(125 ME/m 2 s). Saccharomvces cerevi- 
siae(Yeast for laboratory use) S288C strain registered at the Yeast Genetic Stock Center is used. For £a cerevisiae . 
YPD media(yeast extract 1%, bactopeptone 2%. glucose 2%) is used. 

(EXAMPLE 2) Preparation of whole RNA in Phaffia rhodozvma 

Phaffia rhodozvma ATCC 24230 strain is cultured with shaking for approx. 24 hr at 20°C in 400 ml of YM media. 
When the turbidity of the media reached at OD 600 = 0.4. the bacteria are collected and frozen in liquid nitrogen. The 
frozen bacteria are stored in the freezer at -80^ and used for preparing total RNA. After thawing the frozen bacteria in 
a tube on ice. the bacteria are suspended in 6 ml of ANE buffer(l0 mM sodium acetate. 100 mM sodium chloride, 1 mM 
EDTA, pH 6.0). Glass beads are added to cover the surface of the bacteria layer. Then, 600 \i\ of 10% SDS and 6 ml of 
phenol prewarmed at 65°C are added. The suspension is kept at 65°C for 5 minutes, and the tube is vortexed to 
crushed cell membranes at every 30 seconds. Then, the suspension is rapidly cooled down to room temperature and 
centrifuged for 10 minutes at 1,500 x g at room temperature. Equal volume of phenol is added to the supernatant and 
vortex for 2 minutes. Then the suspension was centrifuged for 10 minutes at 1,500 x g at room temperature. Then, by 
using equal volume of phenol/chloroform(1/1 (v/v)) and chloroform alone, the same procedures as above are performed. 
To the resulted supernatant, one tenth volume of 3 M sodium acetate and three volume of ethanol are added; then the 
supernatant is stored in the freezer at -20°C for 30 minutes. The supernatant is centrifuged for 15 minutes at 15,000 x 
g at 4°C, a pellet is rinsed with 70% ethanol and dried. The residual is dissolved in 200 p.! of sterilized water to make 
total RNA solution of Ph_ rhodozvma . By this preparation procedure, 1 .6 mg of total RNA is obtained. 

(EXAMPLE 3) Preparation of whole RNA in Haematococcus pluvialis 

Haematococcus pluvialis NIES-144 strain is cultured for approx. 4 days in 800 ml of the basic culture media under 
the condition of 20°C. light intensity at 20 nE/m 2 s and 12 hr light/12 hr dark cycle. Then, both acetic acid 45 mM and 
ferrous sulfate heptahydrate 450 fiM as final concentrations are added. The j± pluvialis in the media is cultured for 
approx. 12 hr at 20°C with light (125 nE/m 2 s). The bacteria are collected from the media, frozen in liquid nitrogen arid 
crushed in the mortar to give powder. Then, three ml of ISOGEN-LS[Nippon Gene K.K.] is added to the powder and 
stand for 5 minutes. Then 0.8 ml of chloroform is added, and the solution is stirred vigorously for 1 5 seconds and stand 
at room temperature for 3 minutes. The solution is centrifuged for 15 minutes at 4°C, 12,000 x g, two ml of isopropanol 
is added to the supernatant and the supernatant is stood at room temperature for 10 minutes. Then, the solution is cen- 
trifuged for 10 minutes at 4° C, 12.000 x g. The resulted pellet is rinsed with 70% ethanol to dry. After drying, the residual 
is dissolved in 1 mi of TE buffer(10 mM Tris-HCI pH 8.0, 1 mM EDTA) to make total RNA solution of Ha pluvialis . By 
this preparation procedure, 4.1 mg of whole RNA was obtained. 

(EXAMPLE 4) Establishing cDNA expression libraries of Phaffia rhodozvma and Haematococcus pluvialis 

By using Oligotex<JT30 Super[Takara Syuzo K.K.], poly A + RNA from Phaffia rhodozyma and Haematococcus plu- 
vialis are purified from approx. 1 mg total RNA respectively. The purification is performed according to the methods 
mentioned in the package insert. By following the method, approx. 26 \xg of poly A + mRNA from P£l rhodozvma and 
approx. 14 ng of it from Ha, pluvialis are purified. 

Preparation of cDNA is performed with Superscript™ plasmid system(GIBCO BRL) by the method mentioned in the 
package insert with some modifications. Approx. 5 ng of poly A + mRNA is used. A synthetic DNA consisting of the rec- 
ognition sequence for the restriction enzyme Noil and 15 mers oligo-dT is used as a primer. The complementary DNA 
is synthesized with reverse transcriptase, SUPERSCRIPT RT. Then, by using Escherichia coli DNA ligase, E. co_N DNA 
polymerase and E £pi RNase H, double strand DNA is synthesized. Then, the linker of the restriction enzyme £ajl is 
bound by using T4 DNA ligase. cDNA is designed to have the £all site at the upstream terminal of itself and the Not l 
site at the downstream of poly A. Fractionation by size of these cDNAs is performed by electrophoresis and the fractions 
ranging from 0.7 kb to 3.5 kb are collected. cDNA in the collected fractions is ligated to cDNA expression vector 
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pSPORT I NQll-Sall-Cut by using both the ligation buffer which is included in the kit. 50 mM Tris-HCI pH 7.6, 10 mM 
MgC! 2 . 1 mM ATP, 1 mM DTT, 5% PEG 8,000 and T4 DNA Ugase. The cDNA expression vector pSPORT I has lac pro- 
moter at the upstream of the Sail site and can express cDNA in fL CQli- Then, by using whole the ligated DNA solution, 
transformation of the competent cells of coil DH5a prepared is performed according to the method described in 
"Molecular Cloning 2nd edition: Cold Spring Harbor Laboratory, 1.21-1.41(1989). Approx. 200.000 transformed strains 
of P£l rhodozvma and approx. 40,000 transformed strains of pluvialis are obtained. After collecting all of the trans* 
formants. the plasmid DNA is prepared according to the method described in "Molecular Cloning 2nd edition, ibid." As 
a result. 0.9 mg and 0.6 mg of plasmid DNAs are obtained respectively and these are assigned as cDNA libraries of Ph. 
rhodozvma and Ha. pluvialis . 

(EXAMPLE 5) Preparation of carotenoid -producing CQli 

The plasmid pCARl6(Misawa, N., Nakagawa. M., Kobayashi. K.. Yamano, S. ( Izawa, Y, Nakamura, K.. Harashima, 
K., "Elucidation of the Erwinia uredovora carotenoid biosynthetic pathway by functional analysis of gene products 
expressed in Escherichia qqU", J. Bacteriol.. 172:p.6704-67l2 (1990) and Japanese Patent Application laid-open No. 
HEI 3-58786 (Japanese Patent Application filing No. HEI 2-53255): "A DNA chain useful for synthesis of carotenoids" 
by the present inventors) having the carotenoid synthesis genes except for £rj2 derived from Erwinia uredovora . is 
digested with BsjEII, treated with Klenow enzyme and religated to inactivate the £rtX gene by frame shift. After that, the 
6 0 kb Asp7l8 (Kpn l)-EcoRI fragment containing qtjE, £rjB, grJI and crjY genes necessary for p-carotene production is 
taken out. The fragment is then inserted into the EcqRV sites of the E. cfili vector pACYCl84 and the desirable plas- 
mid(named pACCARl6AcrtX, FIGURE 10) is obtained. E. coli containing this plasmid (pACCARl6AcrtX) is chloram- 
phenicol resistant and has yellowish color due to p-carotene production. 

Then, the plasmid pCARl6 is digested with BslE!l/£naBI, treated with Klenow enzyme and religated to remove the 
2.26 kb BsjEII-£naBI fragment containing £rjX and £iY genes. After that, the 3.75 kb Asp 7l8(Kpnl)-EcoRI fragment 
containing crjE, £rjB and crjl genes necessary for lycopene production is taken out. The fragment is then inserted into 
the E£fiRV sites of the vector pACYCl84 and the desirable plasmid(named pACCRT-EIB, FIGURE 10) is 

obtained. E, coji containing pACCRT-EIB is chloramphenicol resistant and has reddish color due to lycopene production 
(Cunningham Jr., F X., Chamovitz, D., Misawa, N., Gatt. E., Hirschberf, J.. "Cloning and functional expression in 
Escherichia soli of a cyanobacterial gene for lycopene cyclase, the enzyme that catalyzes the biosynthesis of p-caro- 
tene", FEBS Lett., 328: 130-138 (1993)). 

Then, the plasmid pCARl6 is digested with BstEll/Eco52l. treated with Klenow enzyme and religated to remove 
the 3.7 kb BstEII-Eco52l fragment containing crtX, £rtY and crtl genes. After that, the 2.3 kb Aso718 fKpn h-EcoRI frag- 
ment containing QrtE and £rjB genes(FIGURE 2) necessary for phytoene production is taken out. The fragment is then 
inserted into the Ei&RV sites of the qqH vector pACYCl 84 and the decibel plasmidfnamed pACCRT-EB, FIGURE 1 0) 
is obtained. EL £Q!i containing pACCRT-EB is chloramphenicol resistant and does not show color change as phytoene 
is colorless (Linden, H,, Misawa, N., Chamovitz, D., Pecker, I., Hirschberg, J., Sandmann, G.. "Functional complemen- 
tation in Escherichia qq\\ of different phytoene desaturase genes and analysis of accumulated carotenes", Z. Naturfor- 
sch. 46c: 1045-1051 (1991)). 

(EXAMPLE 6) Screening of genes that increase p-carotene production 

As the E cq!i strain JM101 containing the above plasmid pACCARl 6AcrtX shows yellowish color due to p-carotene 
production, it was investigated whether more yellowish transformant can be obtained by introducing cDNA expression 
library of Phaffia rhodozvma or Haematococcus pluvialis . As a first step, competent cells of E, qqU JM101 containing 
pACCARl6AcrtX are prepared according to the method described in "Molecular cloning 2nd edition: Cold Spring Har- 
bor Laboratory. 1 .21 -1 .41(1989). Then, one hundred ng of each cDNA expression library of Ph. rhodozvma and Ha. plu- 
vialis is introduced to 1 ml of the competent cells. Approx. 200,000 transformants of Ph. rhodozvma and approx. 40,000 
transfer mants of Ha. pluvialis are obtained and inoculated for screening on the LB plate(bactotrypton 1%, yeast extract 
0.5%. NaCI 1%, agar 1.5%) containing 150 fxg/ml of ampicillin, 30 ng/ml of chloramphenicol and 1 mM of IPTG. From 
the screening, 5 strains of PTv rhodozvma and 1 0 strains of Ha pluvialis shows deep yellowish color than other strains 
and they are isolated. The plasmid DNA extracted from these strains is subject to restriction enzyme analysis, and it 
was found that the plasmids from the five strains and ten strains have common DNA fragment respectively. Of these 
screened plasmids derived from the cDNA expression libraries, a plasmid from Ptr rhodozvma was named pRH1 (Fig- 
ure 1 1 ) and another plasmid from Ha^ pluvialis was named pHP1 . In addition to that, a fragment is taken out after digest- 
ing pHP1 with £ajl and Ngil, and then, the fragment is inserted into pBluescript KS+. The resulted plasmid was named 
pHPl 1 (FIGURE 1 1 ) and was used for the experiments mentioned below. 
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(EXAMPLE 7) Nucleotide sequence determination on the gene that increases p-carotene production 

From the plasmids pRH1 and pHPl, the deletion plasmids which lack various tengths from the original plasmids 
are prepared by the following procedures. By using those deletion plasmids, the nucleotide sequences are determined. 

5 Decomposition of pRP1 is performed with both EcoRI and PsJI, or with both NoJI and Sph l. Decomposition of pHP1 is 
performed with both Aalll and Bam HI. or with both Kpjnl and EcqRI. After extraction with phenol/chloroform, DNA is 
recovered by ethanol precipitation. Each DNA fraction is then dissolved in 100 \x\ portions of Exolll buffer(50mM Tris- 
HCI, 100mM NaCI, 5mM MgCI 2 . 10mM 2-mercaptoethanol, pH 8.0) and is kept at 37°C after addition or 180 units of 
Exolll nuclease. Ten \l\ portions of the solution are sampled every 30 seconds and transferred to tubes containing 10 

w of MB buffer(40 mM NaCI, 2 mM 2nCI 2 . 10% glycerol. pH 4.5) in an ice bath. After sampling, the 10 tubes are kept at 
65°C for 10 minutes to inactivate the enzyme. Then, 5 units of mung bean nuclease is added and kept at 37°C for 30 
minutes. From one original plasmid, ten different kind of DNA fragments are recovered by agarose gel electrophoresis. 
The degree of deletion of each fragment varies. The terminals of the recovered DNAs are smoothed with Klenow 
enzyme to subject to ligation reaction at 16°C overnight, and by using resulting DNA, £L coji DH5a is transformed to 

is obtain clones. The plasmids are prepared from the various clones obtained, and nucleotide sequences are determined 
by using luminescence primer cycle sequence kit(Applied Biosystems corp.) with an automatic sequencer. 

As a result, it was found that the nucleotide sequence of the cDNA in pRH1 derived from Phaffia rhodozvma con- 
sists of 1 .099 base pairs (SEQUENCE ID No. 4), and there is an open reading frame which encodes a polypeptide hav- 
ing 251 amino acids (which corresponds the region from A to B in Figures 4 and 5). It was also found that the nucleotide 

20 sequence of the cDNA in pHP1 derived from Haematococcus pluvialis consists of 1 ,074 base pairs (SEQUENCE ID 
No. 5), and there is an open reading frame which encodes a polypeptide having 259 amino acids (which corresponds 
the region from C to D in Figures 6 and 7). The amino acid sequences expected from these open reading frames are 
investigated by analyzing homology in the Gene Bank Both of the amino acid sequences of Pfr rhodozvma and Ha. 
pluvialis have significant homology with the IPP isomerase gene of Saccharomvces cerevisiae , 27.0% for Pfv rho: 

25 dozvma and 20.3% for Ha, pluvialis . Therefore the genes were identified as the IPP isomerase gene. 

(EXAMPLE 8) Preparation of total DNA in Saccharomvces cerevisiae 

Preparation of total DNA in Saccharomvces cerevisiae is performed according to the method described in "Meth- 
30 ods in Yeast Genetics; a laboratory course manual: Cold Spring Harbor Laboratory, p.1 31 -132(1 990). Sa cerevisiae 
S288C strain is inoculated in 10 ml of YPD media and cultured at 30°C overnight. The cultured cells are collected and 
suspended in 0.5 ml of sterilized water for washing. By discarding the supernatant, the yeast are collected again. A 0.2 
ml of the mixture(2% Triton X-100, 1% SDS. 100 mM NaCI, 10 mM Tris-HCI(pH 8), 1mM EDTA), 0.2 ml of phenol/chlo- 
roform/isoamylalcohol (25/24/1 (v/v/v)) and 0.3 g of glass beads are added. After vortex mix for 3-4 minutes, two hun- 
35 dred ul of TE buffer(10 mM Tris-HCI(pH 8), 1 mM EDTA) is added. Then the solution is centrifuged for 5 minutes, and 
the supernatant is transferred to another tube and 1 ml of ethanol is added. Then the solution is centrifuged again for 2 
minutes. The resulted pellet is dissolved in 0 4 ml of TE buffer. Then, two ^l of RNase A(10 mg/ml) is added and the 
solution is stood for 5 minutes at 37°C. Then, ten |il of 4 M ammonium acetate and 1 ml of ethanol are added. After 
mixing well, the solution is centrifuged for 2 minutes and the resulted pellet is recovered. After drying the pellet, it was 
40 dissolved with 50 \x\ of TE buffer to have total DNA of cerevisiae S288C strain. By this preparation procedure, 3.4 \xg 
of total DNA was obtained. 

(EXAMPLE 9) Isolation of the IPP isomerase gene of Saccharomyces cerevisiae by PCR method 

45 Based on the nucleotide sequence of the IPP isomerase gene of S_ cerevisiae reported in the aforementioned ref- 
erence(Anderson, M. S., Muehlbacher, M., Street, I P, Profitt, J., Poulter, C. D., "Isopentenyl diphosphate: dimethylallyl 
diphosphate isomerase - an improved purification erf the enzyme and isolation of the gene from Saccharomyces cere- 
visiae ". J. Biol. Chem., 264:19169-19175(1989)), the primers below were synthesized. 
Primer No. 1 S'-TCGATGGGGGTTGCCTTTCTTTTTCGG-S' 

so Primer No. 2 5*-CGCGTTGTTATAGCATTCTATGAATTTGCC-3' 

The procedure was designed to obtain PCR amplified IPP isomerase gene having Tagl sites at the upstream ter- 
minal and Acc ll region at the downstream terminal. Thirty cycles of PCR is performed with 200 ng of total DNA of 3^ 
cerevisiae and PfuDNA polymerase (STRATAGENE). To express the IPP isomerase gene obtained by PCR in E. coli. 
it is digested with both T£gl and Accll. Then, the gene was inserted into Clal sites and Sma l sites of pBluescript KS+ 

55 vector. The resulted plasmid was named pSI1 (Figure 11). This DNA derived from ^ cerevisiae had a nucleotide 
sequence consisting of 1,058 bp (SEQUENCE ID No. 6), and contained a gene which encodes IPP isomerase consist- 
ing of 288 amino acids(corresponds from E to F in Figures 8 and 9). 
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(EXAMPLE 10) Increase of lycopene production amount by introducing the IPP isomerase gene 

Into the lycopene-producing cqH JM101 strain (abbreviated as L hereafter) which contains pACCRT-EIB(Figure 
10). pSPORTl vector, pRH1 plasmid containing the IPP isomerase gene of Phaffia rhodozvma . pHP1 1 plasmid con- 

5 tainmg the IPP isomerase gene of Haematococcus pluvialis or pSM plasmid containing the IPP isomerase gene of Sac- 
charomvces cerevisiae fFIGURE 1 1 ) are introduced respectively. These £jH transformants are then plated on the LB 
plate containing 150 ng/ml of ampicillin(Ap), 30 jig/ml of chloramphenicol(Cm) and 1 mM of IPTG. and cultured at 28'C 
overnight. The three strains, in which each IPP isomerase gene were introduced, showed deep reddish color due to lyc- 
opene production compared with the control (lycopene-producing E.coli) in which only vector is introduced. Further- 

io more, growth rate of the three strains on agar plates were faster than the control strains and they always showed larger 
colonies than those of the control during culture. It is considered that due to introduction and expression of the IPP iso- 
merase gene, the upstream of the biosynthetic pathway up to FPP became more efficient see FIGURE 1), and conse- 
quently, increase of FPP supply led to increase of lycopene. As for faster growth rate, it is also considered that due to 
increase of FPP. sufficient amount of the substrate can be supplied not only for lycopene production but also for the pro- 

T5 duction of other membrane components derived from FPP. that is, FPP or GGPP binding protein, and these compo- 
nents are necessary for growth of £L qqU. 

Increase of lycopene production amount by E.coli carrying the IPP isomerase gene is also confirmed by liquid cul- 
ture After overnight shaking culture of the LB media(5 ml) containing both Ap and Cm at 28°C, 2 ml of the media is 
taken and transferred to 200 ml of 2YT culture media(1 .6% bactotrypton, 1% yeast extract. 0.5% NaCI) containing Ap, 

20 Cp and 0.1 mM IPTG. and shaking culture is performed at 230 rpm, 28°C. Five ml each of the media is sampled several 
hours' intervals to determine growth rate and lycopene content. Growth rate is calculated from absorbance at 650 nm. 
Lycopene content is determined according to the following procedure. The cells collected by centrifugation, 2.5 ml of 
acetone is added to the cells and stand for 30 minutes. Vortex mix once in a while. After filtration, absorbance at 474 
nm is measured to determine the lycopene content based on the absorbance 185.0 for 1 mM lycopene (light path: 1 

25 cm). JASCO UVIDEC-220B spectrophotometer is used. By using HPLC, it is confirmed that these strains actually pro- 
duced lycopene and absorbance at 474 nm is attributable to lycopene. HPLC conditions are mentioned in EXAMPLE 
11. The results are shown in Figure 12(growth curve) and Figure 13(lycopene production curve). As for the growth 
rate(Figure 12), there is no difference among any the strains including the control strains. This result is different from 
that obtained from culture plates. Probably, when the liquid culture is performed, even in the control strain which does 

30 not have exogenous IPP isomerase gene can grow rapidly, because the supply of the substrate for membrane compo- 
nents such as FPP and GGPP binding protein is enough compared to agar culture is done. In contrast, there is a big 
difference between the control strain having no exogenous IPP isomerase gene and the three exogenous IPP isomer- 
ase gene-carrying strains. During culture, the three strains always showed several times higher lycopene production 
amount compared with the control strain. Lycopene production amount per qq\\ dry weight at 28 hr after the start of 

35 the culture is shown in Figure 14. The three strains containing the IPP isomerase gene showed 3.6-4.5 times higher 
production than the control strain. Lycopene-producing E coli containing pHP1 1 is able to produce 1.03 mg lycopene 
per 1g dry weight. 

(EXAMPLE 11) Increase of p-carotene production amount by introducing the IPP isomerase gene 

40 

Into the p-carotene producing E_ qqU JM101 strain (abbreviated as p hereafter) which contains 
pACCAR16AcrtX(FIGURE 10), either pSPORTl vector orpRHI plasmid containing the IPP isomerase gene of Phaffia 
rhodozvma is introduced separately. After overnight shaking culture of the LB media(5 ml) containing both Ap and Cm 
at 28°C, 1 ml of the media is taken and transferred to 100 ml of 2YT media containing Ap, Cm and 0.1 mM IPTG, and 

45 shaking culture is performed at 230 rpm at 28°C for 28 hr. The bacteria are collected by centrrfugation and washed with 
0.85% NaCI. After washing, the bacteria are suspended in 40 ml of acetone and allowed to stand for 30 minutes. Vortex 
mix once in a while. After filtration, absorbance at 454 nm is measured to determine p-carotene content based on the 
absorbance 134.4 for 1 mM p-carotene (light path: 1 cm). The result is shown in FIGURE 14. p-Carotene producing E 
coli containing pRH1 produced 709 u.g of p-carotene per 1 g dry weight. This amount is 1 .5 times higher than the control. 

so By using HPLC on the above acetone extract, it is confirmed that these strains actually produced p-carotene and 
absorbance at 454 nm is attributable to p-carotene. Novapack HR 6^ C18(3.9 x 300 mm, Waters) is used as a column. 
Acetonitrile/methanol/2-propanol(90/6/4(v/v/v)) is used as an elution solvent. A photodiode array detector 996(Waters) 
is used to monitor an elution profile. The results showed that almost 100% of a peak appeared in a visible spectrum is 
p-carotene. As the p-carotene standard preparation, chemically synthesized p-carotene (Sigma) is used. 

55 

(EXAMPLE 12) Increase of phytoene production amount by introducing the IPP isomerase gene 

Into the phytoene producing JM1 01 strain (abbreviated as P hereafter) which contains pACCRT-EB(FIGURE 
10), any of pSPORTl vector , pRH1 plasmid containing the IPP isomerase gene of Phaffia rhodozyma or pHP1 1 plas- 
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mid containing the IPP isomerase gene of Haematococcus pluvialis is introduced separately. After overnight shaking 
culture of the LB media(5 ml) containing both Ap and Cm at 28°C, 1 ml of the media is taken and transferred to 100 ml 
of 2YT media containing Ap, Cm and 0.1 mM IPTG, and shaking culture is performed at 230 rpm at 28°C for 28 nr. The 
bacteria are collected by centrifugation and washed with 0.85% NaCI. After washing, the bacteria are suspended in 40 

5 ml of acetone and allowed to stand for 30 minutes. Vortex mix once in a while. After filtration and drying by rotary evap- 
orator, partition is performed with 40 ml of petroleum ether and water. Absorbance of the ether layer at 286 nm is meas- 
ured to determine phytoene content based on the absorbance 41.2 for 1 mM phytoene (light path: 1 cm). As HPLC 
analysis described in EXAMPLE 1 1 showed that 70% of the absorbance at 286 nm is attributable to phytoene, an and 
also actual phytoene content is adjusted to 70% of the above value. The result is shown in FIGURE 14. Phytoene-pro- 

w ducing E Qcji containing the IPP isomerase gene produced 1 .7-2.1 times higher phytoene than control strain. 

From the above examples, we showed that by introducing the IPP isomerase gene into p-carotene, lycopene or 
phytoene-producing E ecji, several times higher carotenoid production is actually achieved. It is considered that due to 
introduction and expression of the IPP isomerase gene, upstream of the biosynthetic pathway up to FPP became more 
effictent(see FIGURE 1), and consequently, increase of FPP supply led to increase of these carotenoids. Therefore, it 

15 is considered that this findings can be applicable not only for p-carotene, lycopene and phytoene productions but also 
for all other carotenoids such as astaxanthin and zeaxanthin. 

The present invention provides a DNA chain which can significantly increase carotenoid production in biosynthesis 
of carotenoid by microorganisms and a method to obtain several times higher carotenoid production amount by intro- 
ducing and expressing said DNA chain into carotenoid-producing microorganisms. It is expected that said DNA chain 

20 can be applicable to increase production in microorganisms not only for carotenoids but also for terpenoids and so forth 
which require same substrate(FPP) as carotenoids. 
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[ SEQUENCE LISTING] 
SEQUENCE ID No. : 1 
LENGTH: 251 

SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE: 

Met Ser Met Pro Asn lie val Pro Pro Ala Glu Val Arg Thr Glu Gly 

5 10 15 

Leu Ser Leu Glu Glu Tyr Asp Glu Glu Gin Val Arg Leu Met Glu Glu 

20 25 30 

Arg Cys He Leu Val Asn Pro Asp Asp Val Ala Tyr Gly Glu Ala Ser 

35 40 45 

Lys Lys Thr Cys His Leu Met Ser Asp He Asn Ala Pro Lys Asp Leu 

50 55 60 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Arg Pro Ser Asp Gly Ala 
65 70 75 80 

Leu Leu Leu Gin Arg Arg Ala Asp Glu Lys He Thr Phe Pro Gly Met 
85 90 95 

30 Trp Thr Asn Thr Cys Cys Ser His Pro Leu Ser He Lys Gly Glu Val 

100 105 110 

Glu Glu Glu Asn Gin He Gly Val Arg Arg Ala Ala Ser Arg Lys Leu 

115 120 125 

35 Glu His Glu Leu Gly Val Pro Thr Ser Ser Thr Pro Pro Asp Ser Phe 

130 135 140 

Thr Tyr Leu Thr Arg He His Tyr Leu Ala Pro Ser Asp Gly Leu Trp 
145 150 155 160 

Gly Glu His Glu He Asp Tyr lie Leu Phe Ser Thr Thr Pro Thr Glu 

165 170 175 

His Thr Gly Asn Pro Asn Glu Val Ser Asp Thr Arg Tyr Val Thr Lys 

180 185 190 

Pro Glu Leu Gin Ala Met Phe Glu Asp Glu Ser Asn Ser Phe Thr Pro 

195 200 205 

Trp Phe Lys Leu He Ala Arg Asp Phe Leu Phe Gly Trp Trp Asp Gin 

210 215 220 

Leu Leu Ala Arg Arg Asn Glu Lys Gly Glu Val Asp Ala Lys Ser Leu 
225 230 235 240 



40 



45 



50 



55 



BNSOOCID- <EP 076955 1A1 l> 



12 



EP 0 769 551 A1 



Glu Asp Leu Ser Asp Asn Lys Val Trp Lys Met *** 
245 250 

5 



15 



20 



25 



SEQUENCE ID NO-: 2 
LENGTH: 25 9 

SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE: 

Met Gin Leu Leu Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser 

5 10 15 

Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu 

20 25 30 

Cys lie Leu Val Asp Ala Asp Asp Asn He Thr Gly His Val Ser Lys 

35 40 45 

Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu Leu His 

50 55 60 

Arg Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu 
65 70 75 80 

Gin Gin Arg Ala Arg Ser Lys He Thr Phe Pro Ser Val Trp Thr Asn 

85 90 95 

Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp 

100 105 110 

Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala 
115 120 125 

40 Ala He Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His Gin Leu 

130 135 140 

Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala 
145 150 155 160 

45 Asp Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu 

165 170 175 

Met Asp Tyr He Leu Phe He Arg Ala Asn Val Thr Leu Ala Pro Asn 

180 185 190 

Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg 
195 200 205 
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Gin Met Met Gin Pro Asp Asn Gly Leu Glu Trp Ser Pro lip i-he nrg 

210 215 220 

5 He He Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala 

225 230 235 240 

Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His His He 

245 250 255 

to Asn Glu Ala *** 



15 SEQUENCE ID NO.: 3 

LENGTH: 288 

SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE: 

Met Thr Ala Asp Asn Asn Ser Met Pro His Gly Ala Val Ser Ser Tyr 

5 10 15 

Ala Lys Leu Val Gin Asn Gin Thr Pro Glu Asp He Leu Glu Glu Phe 

20 25 30 

Pro Glu He He Pro Leu Gin Gin Arg Pro Asn Thr Arg Ser Ser Glu 

35 40 45 

Thr Ser Asn Asp Glu Ser Gly Glu Thr Cys Phe Ser Gly His Asp Glu 

50 55 60 

Glu Gin He Lys Leu Met Asn Glu Asn Cys He Val Leu Asp Trp Asp 
65 70 75 80 

Asp Asn Ala He Gly Ala Gly Thr Lys Lys Val Cys His Leu Met Glu 

85 90 95 

Asn He Glu Lys Gly Leu Leu His Arg Ala Phe Ser Val Phe He Phe 

100 105 110 

Asn Glu Gin Gly Glu Leu Leu Leu Gin Gin Arg Ala Thr Glu Lys He 
45 115 120 125 

Thr Phe Pro Asp Leu Trp Thr Asn Thr Cys Cys Ser His Pro Leu Cys 

130 135 140 

He Asp Asp Glu Leu Gly Leu Lys Gly Lys Leu Asp Asp Lys lie Lys 
145 150 155 160 

Gly Ala He Thr Ala Ala Val Arg Lys Leu Asp His Glu Leu Gly He 
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50 



55 
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Pro Glu Asp 

5 

lie His Tyr 
195 

Asp Tyr lie 
10 210 

Asn Pro Asn 

225 

Asp Leu Lys 

is 

Phe Lys lie 

2Q Asp Asp Leu 

275 

* * * 

25 

SEQUENCE ID No. : 4 
LENGTH: 109 9 

30 

SEQUENCE TYPE: nucleic acid 
STRANDNESS: double 
TOPOLOGY: linear 

35 

MOLECULAR TYPE: CDNA 
ORIGIN 

4Q ORGANISM: Phaffia rhodozyma 

STRAIN: ATCC 242 30 
SEQUENCE CHARACTERISTIC 
45 CHARACTERISTIC CODE: CDS 

LOCATIION: 99.. 851 

DETERMINATION METHOD: E 

50 SEQUENCE: 

CCCACGCGTC CGCACATCTC GCATATATCA CTTTCCTCCT TCCAGAACAA GTTCTGAGTC 60 



165 

Glu Thr Lys Thr Arg Gly 
180 185 
Met Ala Pro Ser Asn Glu 

200 

Leu Phe Tyr Lys lie Asn 
215 

Val Asn Glu Val Arg Asp 
230 

Thr Met Phe Ala Asp Pro 
245 

lie Cys Glu Asn Tyr Leu 
260 265 
Ser Glu Val Glu Asn Asp 

280 



170 175 
Lys Phe His Phe Leu Asn Arg 
190 

Pro Trp Gly Glu His Glu lie 
205 

Ala Lys Glu Asn Leu Thr Val 
220 

Phe Lys Trp Val Ser Pro Asn 
235 240 
Ser Tyr Lys Phe Thr Pro Trp 
250 255 
Phe Asn Trp Trp Glu Gin Leu 
270 

Arg Gin lie His Arg Met Leu 
285 
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20 



AACCGAAAAG AAAGAAGGCA GAGGAAAATA TATTCTAG ATG TCC ATG CCC AAC ATT 116 

Met Ser Met Pro Asn lie 
5 

GTT CCC CCC GCC GAG GTC CGA ACC GAA GGA CTC AGT TTA GAA GAG TAC 164 
val Pro Pro Ala Glu val Arg Thr Glu Gly Leu Ser Leu Glu Glu Tyr 

10 15 20 

GAT GAG GAG CAG GTC AGG CTG ATG GAG GAG CGA TGT ATT CTT GTT AAC 212 
Asp Glu Glu Gin Val Arg Leu Met Glu Glu Arg Cys lie Leu Val Asn 

25 30 35 

CCG GAC GAT GTG GCC TAT GGA GAG GCT TCG AAA AAG ACC TGC CAC TTG 2 60 
Pro Asp Asp Val Ala Tyr Gly Glu Ala Ser Lys Lys Thr Cys His Leu 

40 45 50 

ATG TCC AAC ATC AAC GCG CCC AAG GAC CTC CTC CAC CGA GCA TTC TCC 308 
Met Ser Asn lie Asn Ala Pro Lys Asp Leu Leu His Arg Ala Phe Ser 
55 60 65 70 

GTG TTT CTC TTC CGC CCA TCG GAC GGA GCA CTC CTG CTT CAG CGA AGA 356 
Val Phe Leu Phe Arg Pro Ser Asp Gly Ala Leu Leu Leu Gin Arg Arg 
25 75 80 85 

GCG GAC GAG AAG ATT ACG TTC CCT GGA ATG TGG ACC AAC ACG TGT TGC 404 
Ala Asp Glu Lys lie Thr Phe Pro Gly Met Trp Thr Asn Thr Cys Cys 
90 95 100 

30 AGT CAT CCT TTG AGC ATC AAG GGC GAG GTT GAA GAG GAG AAC CAG ATC 452 

Ser His Pro Leu Ser lie Lys Gly Glu Val Glu Glu Glu Asn Gin He 

105 110 115 

GGT GTT CGA CGA GCT GCG TCC CGA AAG TTG GAG CAC GAG CTT GGC GTG 500 
35 Gly val Arg Arg Ala Ala Ser Arg Lys Leu Glu His Glu Leu Gly Val 

120 125 130 

CCT ACA TCG TCG ACT CCG CCC GAC TCG TTC ACC TAC CTC ACT AGG ATA 54 8 
Pro Thr Ser Ser Thr Pro Pro Asp Ser Phe Thr Tyr Leu Thr Arq lie 

40 

135 140 145 150 

CAT TAC CTC GCT CCG AGT GAC GGA CTC TGG GGA GAA CAC GAG ATC GAC 596 

His Tyr Leu Ala Pro Ser Asp Gly Leu Trp Gly Glu His Glu He Asp 

45 155 1 60 1 65 

TAC ATT CTC TTC TCA ACC ACA CCT ACA GAA CAC ACT GGA AAC CCT AAC 64 4 
Tyr He Leu Phe Ser Thr Thr Pro Thr Glu His Thr Gly Asn Pro Asn 
170 175 180 

50 GAA GTC TCT GAC ACT CGA TAT GTC ACC AAG CCC GAG CTC CAG GCG ATG 692 

Glu Val Ser Asp Thr Arg Tyr val Thr Lys Pro Glu Leu Gin Ala Met 
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15 



20 



40 



45 



50 



55 



185 190 195 

TTT GAG GAC GAG TCT AAC TCA TTT ACC CCT TGG TTC AAG TTG ATT GCC 74 0 

Phe Glu Asp Glu Ser Asn Ser Phe Thr Pro Trp Phe Lys Leu lie Ala 

200 205 210 

CGA GAC TTC CTG TTT GGC TGG TGG GAT CAA CTT CTC GCC AGA CGA AAT 7 88 

Arg Asp Phe Leu Phe Gly Trp Trp Asp Gin Leu Leu Ala Arg Arg Asn 
215 220 225 230 

GAA AAG GGT GAG GTC GAT GCC AAA TCG TTG GAG GAT CTC TCG GAC AAC 8 36 

Glu Lys Gly Glu Val Asp Ala Lys Ser Leu Glu Asp Leu Ser Asp Asn 

235 240 245 

AAA GTC TGG AAG ATG TAGTCGACC CTTCTTTCTG TACAGTCATC TCAGTTCGCC 8 90 

Lys Val Trp Lys Met *** 
250 

TGTTGGTTGC TTGCTTCTTG CTCTTCTTTC TATATATCTT TTTTCTTGCC TGGGTAGACT 950 



TGATCTTTCT ACATAGCATA CGCATACATA CATAAACTCT ATTTCTTGTT CTTTATCTCT 1010 
25 CTTCTAAGGG AATCTTCAAG ATCAATTTCT TTTTGGGCTA CAACATTTCA GATCAATGTT 1070 

GCTTTTCAGA CTACAAAAAA AAAAAAAAA 10 99 

30 



35 SEQUENCE ID No. : 5 

LENGTH: 1074 

SEQUENCE TYPE: nucleic acid 
STRANDNESS: double 
TOPOLOGY: linear 
MOLECULAR TYPE: CDNA 
ORIGIN 

ORGANISM: Haematococcus pluvialis 
STRAIN: NIES-144 
SEQUENCE CHARACTERISTIC 



17 

BNSOOClD: <EP 0769S51 A1 _!_> 



EP 0 769 551 A1 



20 



25 



45 



50 



55 



CHARACTERISTIC CODE: CDS 
LOCATIION: 14 5.. 921 
DETERMINATION METHOD: E 
SEQUENCE: 

ATCGCTACTT GGAACCTGGC CCGGCGGCAG TCCGATGACG CGATGCTTCG TTCGTTGCTC 60 

AGAGGCCTCA CGCATTTCCC CCGCGTGAAC TCCGCGCAGC AGCCCAGCTG TGCACACGCG 12 0 

CGACTCCAGT TTAGGCCCAG AAGC ATG CAG CTG CTT GCC GAG GAC CGC ACA GAC 17 9 

Met Gin Leu Leu Ala Glu Asp Arg Thr Asp 
5 D 

CAT ATG AGG GGT GCA AGT ACC TGG GCA GGC GGG CAG TCG CAG GAT GAG 222 

His Met Arg Gly Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu 

15 20 25 

CTG ATG CTG AAG GAC GAG TGC ATC TTG GTG GAT GCT GAC GAC AAC ATT 270 

Leu Met Leu Lys Asp Glu Cys lie Leu Val Asp Ala Asp Asp Asn lie 

30 35 40 

ACA GGC CAT GTC AGC AAG CTG GAG TGC CAC AAG TTC CTA CCA CAT CAG 318 

Thr Gly His Val Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin 

45 50 55 

CCT GCA GGC CTG CTG CAC CGG GCC TTC TCT GTA TTC CTG TTT GAC GAC 366 

Pro Ala Gly Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asp Asp 

60 65 70 

CAG GGG CGA CTG CTG CTG CAA CAG CGT GCA CGA TCA AAA ATC ACA TTC 414 

Gin Gly Arg Leu Leu Leu Gin Gin Arg Ala Arg Ser Lys lie Thr Phe 

75 80 85 90 

CCC AGT GTG TGG ACC AAC ACC TGC TGC AGC CAC CCT CTA CAT GGG CAG 4 62 

Pro Ser Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin 

95 100 105 

ACC CCA GAT GAG GTG GAC CAA CTA AGC CAG GTG GCC GAC GGC ACA GTA 510 

Thr Pro Asp Glu Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val 

110 115 120 

CCT GGC GCA AAG GCT GCT GCC ATC CGC AAG TTG GAG CAC GAG CTG GGG 558 

Pro Gly Ala Lys Ala Ala Ala lie Arg Lys Leu Glu His Glu Leu Gly 

125 130 135 

ATA CCA GCG CAC CAG CTG CCG GCC AGC GCG TTT CGC TTC CTC ACG CGT 606 

lie Pro Ala His Gin Leu Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg 

140 145 150 
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TTG CAC TAC TGC GCC GCG GAC GTG CAG CCG GOT GCG ACA CAA 'iCA GCA 654 

Leu His Tyr Cys Ala Ala Asp val Gin Pro Ala Ala Thr Gin Ser Ala 
5 155 160 165 170 

CTC TGG GGC GAG CAC GAA ATG GAC TAC ATC TTA TTC ATC CGG GCC AAC 7 02 

Leu Trp Gly Glu His Glu Met Asp Tyr lie Leu Phe lie Arg Ala Asn 

175 180 185 

'0 GTC ACC CTT GCG CCC AAC CCT GAC GAG GTG GAC GAA GTC AGG TAC GTG 750 

Val Thr Leu Ala Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val 

190 195 200 

ACG CAG GAG GAG CTG CGG CAG ATG ATG CAG CCG GAC AAT GGG TTG CAA 7 98 

Thr Gin Glu Glu Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin 

205 210 215 

TGG TCG CCG TGG TTT CGC ATC ATC GCC GCG CGC TTC CTT GAG CGC TGG 84 6 

Trp Ser Pro Trp Phe Arg He He Ala Ala Arg Phe Leu Glu Arg Trp 

220 225 230 

TGG GCT GAC CTA GAC GCG GCC CTG AAC ACT GAC AAA CAC GAG GAT TGG 8 94 

Trp Ala Asp Leu Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp 
235 240 245 250 

GGA ACG GTG CAT CAC ATC AAC GAA GCG TGA AAACAG AAGCTGTAGG 940 

Gly Thr Val His His He Asn Glu Ala *** 
255 

30 ATGTCAAGAC ACGTCATGAG GGGGCTTGGC ATCTTGGCGG CTTCGTATCT CTTTTTACTG 1000 

AGACTGAACC TGCAGCTGGA GACAATGGTG AGCCCAATTC AACTTTCCGC TGCACTGGAA 1060 
35 AAAAAAAAAA AAAA 1074 



20 



25 



40 



45 



50 



SEQUENCE ID No. : 6 
LENGTH: 1058 

SEQUENCE TYPE: nucleic acid 
STRANDNESS: double 
TOPOLOGY: linear 
MOLECULAR TYPE: genomic DNA 
ORIGIN 



55 
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20 



25 



30 



35 



50 



ORGANISM: Saccharomyces cerevisiae 

STRAIN: S288C 

SEQUENCE CHARACTERISTIC 

CHARACTERISTIC CODE: CDS 

LOCATIION: 187.. 1050 

DETERMINATION METHOD: S 
SEQUENCE: 

TCGATGGGGG TTGCCTTTCT TTTTCGGTCT TAACTCCATT TATATTTATT TATTCATTTT 60 
TATCTATTTA ACAGGAAACA GTTTTCTAGT GACAAGAAGG CGTATATCCC ACTTAATTCA 120 
ATATTAGAGT ATTCGTATTT GGAATACAGG AAGAGTAAAA ATAAGCCAAA AATTCATTAC 180 



ACCTCA ATG ACT GCC GAC AAC AAT AGT ATG CCC CAT GGT GCA GTA TCT AGT 231 
Met Thr Ala Asp Asn Asn Ser Met Pro His Gly Ala Val Ser Ser 
5 10 15 

TAC GCC AAA TTA GTG CAA AAC CAA ACA CCT GAA GAC ATT TTG GAA GAG 27 9 
Tyr Ala Lys Leu Val Gin Asn Gin Thr Pro Glu Asp lie Leu Glu Glu 

20 25 30 

TTT CCT GAA ATT ATT CCA TTA CAA CAA AGA CCT AAT ACC CGA TCT AGT 327 
Phe Pro Glu lie He Pro Leu Gin Gin Arg Pro Asn Thr Arg Ser Ser 

35 40 45 

GAG ACG TCA AAT GAC GAA AGC GGA GAA ACA TGT TTT TCT GGT CAT GAT 375 
Glu Thr Ser Asn Asp Glu Ser Gly Glu Thr Cys Phe Ser Gly His Asp 

50 55 60 

GAG GAG CAA ATT AAG TTA ATG AAT GAA AAT TGT ATT GTT TTG GAT TGG 423 
Glu Glu Gin He Lys Leu Met Asn Glu Asn Cys He Val Leu Asp Trp 
40 65 70 75 

GAC GAT AAT GCT ATT GGT GCC GGT ACC AAG AAA GTT TGT CAT TTA ATG 471 
Asp Asp Asn Ala He Gly Ala Gly Thr Lys Lys Val Cys His Leu Met 
80 85 90 95 

45 GAA AAT ATT GAA AAG GGT TTA CTA CAT CGT GCA TTC TCC GTC TTT ATT 519 

Glu Asn He Glu Lys Gly Leu Leu His Arg Ala Phe Ser Val Phe He 

100 105 110 

TTC AAT GAA CAA GGT GAA TTA CTT TTA CAA CAA AGA GCC ACT GAA AAA 567 
Phe Asn Glu Gin Gly Glu Leu Leu Leu Gin Gin Arg Ala Thr Glu Lys 
115 120 125 



55 
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ATA ACT TTC CCT GAT CTT TGG ACT AAC ACA TGC TGC TOT CAT CCA CTA 615 

He Thr Phe Pro Asp Leu Trp Thr Asn Thr Cys Cys Ser His Pro Leu 
5 130 135 140 

TGT ATT GAT GAC GAA TTA GGT TTG AAG GGT AAG CTA GAC GAT AAG ATT 663 

Cys He Asp Asp Glu Leu Gly Leu Lys Gly Lys Leu Asp Asp Lys He 
145 150 155 

to AAG GGC GCT ATT ACT GCG GCG GTG AGA AAA CTA GAT CAT GAA TTA GGT 711 

Lys Gly Ala He Thr Ala Ala Val Arg Lys Leu Asp His Glu Leu Gly 

160 165 170 175 

ATT CCA GAA GAT GAA ACT AAG ACA AGG GGT AAG TTT CAC TTT TTA AAC 759 

He Pro Glu Asp Glu Thr Lys Thr Arg Gly Lys Phe His Phe Leu Asn 

180 185 190 

AGA ATC CAT TAC ATG GCA CCA AGC AAT GAA CCA TGG GGT GAA CAT GAA 807 

Arg He His Tyr Met Ala Pro Ser Asn Glu Pro Trp Gly Glu His Glu 

195 200 205 

ATT GAT TAC ATC CTA TTT TAT AAG ATC AAC GCT AAA GAA AAC TTG ACT 855 

He Asp Tyr He Leu Phe Tyr Lys lie Asn Ala Lys Glu Asn Leu Thr 

210 215 220 

GTC AAC CCA AAC GTC AAT GAA GTT AGA GAC TTC AAA TGG GTT TCA CCA 903 

Val Asn Pro Asn Val Asn Glu Val Arg Asp Phe Lys Trp Val Ser Pro 

225 230 235 

AAT GAT TTG AAA ACT ATG TTT GCT GAC CCA AGT TAC AAG TTT ACG CCT 951 

Asn Asp Leu Lys Thr Met Phe Ala Asp Pro Ser Tyr Lys Phe Thr Pro 

240 245 250 255 

TGG TTT AAG ATT ATT TGC GAG AAT TAC TTA TTC AAC TGG TGG GAG CAA 999 

Trp Phe Lys He He Cys Glu Asn Tyr Leu Phe Asn Trp Trp Glu Gin 

260 265 270 

TA GAT GAC CTT TCT GAA GTG GAA AAT GAC AGG CAA ATT CAT AGA ATG 104 7 

Leu Asp Asp Leu Ser Glu Val Glu Asn Asp Arg Gin He His Arg Met 
40 275 280 285 
CTA TAA CAACG 1058 
Leu *** 

45 



25 



30 



35 



Claims 

so 1. A DNA chain having characteristic of increasing carotenoid production, and containing the nucleotide sequence 
which encodes the polypeptide having the amino acid sequence substantially described in SEQUENCE ID No. 1 , 
or a DNA chain which hybridizes with said DNA chain. 

2. A DNA chain having characteristic of increasing carotenoid production, and containing the nucleotide sequence 
55 which encodes the polypeptide having the amino aid sequence substantially described in SEQUENCE ID No. 2, or 

a DNA chain which hybridizes with said DNA chain. 

3. A method for producing carotenoid characterized by introducing DNA chain described in one of claim 1 or 2 into 
carotenoid -producing microorganisms, culturing said transformed microorganism and obtaining higher carotenoid 
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content in the culture broth and cells . 

4. A method for producing carotenotd characterized by introducing DNA chain containing the nucleotide sequence 
which encodes the polypeptide having the amino acid sequence substantially described in SEQUENCE ID No. 3, 
5 or DNA chain which hybridizes with said DNA chain introducing to carotenoid-producing microorganism, culturing 

said transformed microorganism and obtainig higher carotenotd content in the culture broth and cells . 
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FIG. 2 




lycopene 
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FIG. 4 



A 



ATG 
Met 



77A 

Leu 



GTT 
Val 



ATG 
Met 



TCC 
Ser 



GAA 
Glu 



AAC 
Asn 



TCC 
Ser 



CTC 
Leu 



ATT 
lie 



AAG 



AAG 



ACC 
Thr 



CAC 
His 



CCT 
Pro 



TTC 
Phe 



ACG 
Thr 



GGC 
Gly 



TTG 
Leu 



TAC 
Tyr 



GAG 
Glu 



ATG CCC 
Met Pro 

63 

GAG TAC 
Glu Tyr 

117 

CCG GAC 
Pro Asp 

171 

AAC ATC 
Aan He 

225 

CGC CCA 
Arg Pro 

279 

TTC CCT 
Phe Pro 

333 

GAG GTT 
Glu Val 

387 

GAG CAC 
Glu His 

441 

CTC ACT 
Leu Thr 

495 

ATC GAC 
lie Asp 



AAC 
Asn 



GAT 
Asp 



GAT 
Asp 



AAC 
Asn 



TCG 
Ser 



GGA 
Gly 



GAA 
Glu 



GAG 
Glu 



AGG 
Arg 

TAC 
Tyr 



18 

ATT GTT 
He Val 

72 

GAG GAG 
Glu Glu 

126 

GTG GCC 
Val Ala 

180 

GCG CCC 
Ala Pro 

234 

GAC GGA 
Asp Gly 

288 

ATG TGG 
Met Trp 

342 

GAG GAG 
Glu Glu 

396 

CTT GGC 
Leu Gly 

450 

ATA CAT 
He His 

504 

ATT CTC 
lie Leu 



27 

CCC CCC 
Pro Pro 

81 

CAG GTC 
Gin Val 

135 
TAT GGA 
Tyr Gly 

189 
AAG GAC 
Lys Asp 

243 
GCA CTC 
Ala Leu 

297 
ACC AAC 
Thr Asn 

351 
AAC CAG 
Asn Gin 

405 
GTG CCT 
Val Pro 

4S9 
TAC CTC 
Tyr Leu 

513 
TTC TCA 
Phe Ser 



GCC 
Ala 



AGG 
Arg 



GAG 
Glu 



CTC 
Leu 



CTG 
Leu 



ACG 
Thr 



ATC 
lie 



ACA 
Thr 



GCT 
Ala 



ACC 
Thr 



36 

GAG GTC 
Glu Val 

90 

CTG ATG 
Leu Met 

144 
GCT TCG 
Ala Ser 

198 
CTC CAC 
Leu His 

252 
CTT CAG 
Leu Gin 

306 
TGT TGC 
Cys Cys 

360 
GGT GTT 
Gly Val 

414 

TCG TCG 
Ser Ser 

468 
CCG AGT 
Pro Ser 

522 
ACA CCT 
Thr Pro 



CGA 
Arg 



GAG 
Glu 



AAA 
Lys 

CGA 
Arg 



CGA 
Arg 



AGT 
Ser 



CGA 
Arg 



ACT 
Thr 



GAC 
Asp 



45 

ACC GAA 
Thr Glu 

99 

GAG CGA 
Glu Arg 

153 
AAG ACC 
Lys Thr 

207 
GCA TTC 
Ala Phe 

261 
AGA GCG 
Arg Ala 

315 
CAT CCT 
His Pro 

369 
CGA GCT 
Arg Ala 

423 
CCG CCC 
Pro Pro 

477 
GGA CTC 
Gly Leu 



GGA 

Gly 



TGT 
Cys 



TGC 
Cys 
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GAC 
Asp 



TTG 
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Asp 



TGG 
Trp 
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Thr 



AAC 
Asn 



S49 

GAA GTC 
Glu Val 



TCT 
Ser 



558 

GAC ACT 
Asp Thr 



567 
CGA TAT 
Arg Tyr 



GTC 
Val 



576 
ACC AAG 
Thr Lys 



CCC 
Pro 



531 
GAA CAC 
Glu His 

585 
GAG CTC 
Glu Leu 



ACT 
Thr 



CAG 
Gin 



54 

CTC AGT 
Leu Ser 
16 
103 
ATT CTT 
He Leu 
36 
162 
CAC TTG 
His Leu 
54 
216 
GTG TTT 
Val Phe 
72 
270 
GAG AAG 
Glu Lys 
90 
324 
ATC 
Hi 
103 
378 
TCC CGA 
Ser Arg 
12 6 
432 
TCG TTC 
Ser Phe 
144 
486 
GGA GAA 
Gly Glu 
162 
540 
GGA AAC 
Gly Asn 
180 
594 
GCG ATG 
Ala Met 
198 



AGC 
Ser 
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FIG. 5 



603 612 621 630 639 648 

TTT GAG GAC GAG 7CT AAC TCA TTT ACC CCT TGG TTC AAG TTG ATT GCC CGA GAC 
Phe Glu Asp Glu Ser Asn Ser Phe Thr Pro Trp Phe Lys Leu lie Ala Arg Asp 

216 

657 666 675 684 693 702 

TTC CTG TTT GGC TGG TGG GAT CAA CTT CTC GCC AGA CGA AAT GAA AAG GGT GAG 
Phe Leu Phe Gly Trp Trp Asp Gin Leu Leu Ala Arg Arg Asn Glu Lys Gly Glu 

234 

711 720 729 738 747 756 

GTC GAT GCC AAA TCG TTG GAG GAT CTC TCG GAC AAC AAA GTC TGG AAG ATG TAG 
Val Asp Ala Lys Ser Leu Glu Asp Leu Ser Asp Asn Lys Val Trp Lys Meti*** 

2Slf 

B 
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FIG. 6 



c 

I 9 18 27 36 45 54 

'ATG CAG CTG CTT GCC GAG GAC CGC AC A GAC CAT ATG AGG GGT GCA AGT ACC TGG 
Met Gin Leu Leu Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 

18 

63 72 81 90 99 108 

GCA GGC GGG CAG TCG CAG GAT GAG CTG ATG CTG AAG GAC GAG TGC ATC TTG GTG 
Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu Cys He Leu Val 

36 

117 126 135 144 153 162 

GAT GCT GAC GAC AAC ATT AC A GGC CAT GTC AGC AAG CTG GAG TGC CAC AAG TTC 
Asp Ala Asp Asp Asn He Thr Gly His Val Ser Lys Leu Glu Cys His Lys Phe 

54 

171 180 189 198 207 216 

CTA CCA CAT CAG CCT GCA GGC CTG CTG CAC CGG GCC TTC TCT GTA TTC CTG TTT 
Leu Pro His Gin Pro Ala Gly Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe 

72 

225 234 243 252 261 270 

GAC GAC CAG GGG CGA CTG CTG CTG CAA CAG CGT GCA CGA TCA AAA ATC AC A TTC 
Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin Arg Ala Arg Ser Lys He Thr Phe 

90 

279 288 297 306 315 324 

CCC AGT GTG TGG ACC AAC ACC TGC TGC AGC CAC CCT CTA CAT GGG CAG ACC CCA 
Pro Ser Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro 

108 

333 342 351 360 369 378 

GAT GAG GTG GAC CAA CTA AGC CAG GTG GCC GAC GGC ACA GTA CCT GGC GCA AAG 
Asp Glu Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys 

126 

387 396 405 414 423 432 

GCT GCT GCC ATC CGC AAG TTG GAG CAC GAG CTG GGG ATA CCA GCG CAC CAG CTG 
Ala Ala Ala He Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His Gin Leu 

144 

441 450 459 468 477 486 

CCG GCC AGC GCG TTT CGC TTC CTC ACG CGT TTG CAC TAC TGC GCC GCG GAC GTG 
Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val 

162 

495 504 513 522 531 540 

CAG CCG GCT GCG ACA CAA TCA GCA CTC TGG GGC GAG CAC GAA ATG GAC TAC ATC 
Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Asp Tyr He 

130 

549 558 567 576 585 594 

TTA TTC ATC CGG GCC AAC GTC ACC CTT GCG CCC AAC CCT GAC GAG GTG GAC GAA 
Leu Phe He Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp Glu Val Asp Glu 

198 
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G.7 



603 612 621 630 639 648 

GTC AGG TAC GTG ACG CAG GAG GAG CTG CGG CAG ATG ATG CAG CCG GAC AAT GGG 
Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met Met Gin Pro Asp Asn Gly 

216 

657 666 675 634 693 702 

TTG CAA TGG TCG CCG TGG TTT CGC ATC ATC GCC GCG CGC TTC CTT GAG CGC TGG 
Leu Gin Trp Ser Pro Trp Phe Arg lie He Ala Ala Arg Phe Leu Glu Arg Trp 

234 

71I 720 729 738 747 756 

TGG GCT GAC CTA GAC GCG GCC CTG AAC ACT GAC AAA CAC GAG GAT TGG GGA ACG 
Trp Ala Aap Leu Aap Ala Ala Leu Aan Thr Aap Ly3 His Glu Asp Trp Gly Thr 

252 

765 774 780 

GTG CAT CAC ATC AAC GAA GCG TGA 
Val His His lie Aan Glu Alai*** 

259T 

D 
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FIG. 8 



E 

I 9 18 27 36 45 54 

*ATG ACT GCC GAC AAC AAT AGT ATG CCC CAT GGT GCA GTA TC7 AGT TAC GCC AAA 
Met Thr Ala Asp Asn Asn Ser Met Pro His Gly Ala Val Ser Ser Tyr Ala Lys 

18 

63 72 81 90 99 108 

TTA GTG CAA AAC CAA AC A CCT GAA GAC ATT TTG GAA GAG TTT CCT GAA ATT ATT 
Leu Val Gin Asn Gin Thr Pro Glu Asp He Leu Glu Glu Phe Pro Glu He He 

36 

117 126 135 144 153 162 

CCA TTA CAA CAA AGA CCT AAT ACC CGA TCT AGT GAG ACG TCA AAT GAC GAA AGC 
Pro Leu Gin Gin Arg Pro Asn Thr Arg Ser Ser Glu Thr Ser Asn Asp Glu Ser 

54 

171 180 189 198 207 216 

GGA GAA ACA TGT TTT TCT GGT CAT GAT GAG GAG CAA ATT AAG TTA ATG AAT GAA 
Gly Glu Thr Cys Phe Ser Gly His Asp Glu Glu Gin He Lys Leu Met Ajn Glu 

72 

225 234 243 252 261 270 

AAT TGT ATT GTT TTG GAT TGG GAC GAT AAT GCT ATT GGT GCC GGT ACC AAG AAA 
Asn Cys He Val Leu Asp Trp A3? Asp Asn Ala He Gly Ala Gly Thr Lys Lys 

90 

279 2B8 297 306 315 324 

GTT TGT CAT TTA ATG GAA AAT ATT GAA AAG GGT TTA CTA CAT CGT GCA TTC TCC 
Val Cys His Leu Met Glu Asn He Glu Lys Gly Leu Leu His Arg Ala Phe Ser 

108 

333 342 351 360 369 378 

GTC TTT ATT TTC AAT GAA CAA GGT GAA TTA CTT TTA CAA CAA AGA GCC ACT GAA 
Val Phe He Phe Asn Glu Gin Gly Glu Leu Leu Leu Gin Gin Arg Ala Thr Glu 

126 

387 396 405 414 423 432 

AAA ATA ACT TTC CCT GAT CTT TGG ACT AAC ACA TGC TGC TCT CAT CCA CTA TGT 
Lys He Thr Phe Pro Asp Leu Trp Thr Asn Thr Cys Cys Ser His Pro Leu Cys 

144 

441 450 459 468 477 486 

ATT GAT GAC GAA TTA GGT TTG AAG GGT AAG CTA GAC GAT AAG ATT AAG GGC GCT 
He Asp Asp Glu Leu Gly Leu Lys Gly Lys Leu Asp Asp Lys He Lys Gly Ala 

162 

495 504 513 522 531 540 

ATT ACT GCG GCG GTG AGA AAA CTA GAT CAT GAA TTA GGT ATT CCA GAA GAT GAA 
He Thr Ala Ala Val Arg Lys Leu Asp His Glu Leu Gly He Pro Glu Asp Glu 

180 
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G.9 



549 558 567 576 585 594 

ACT AAG AC A. AGG GGT AAG TTT CAC TTT TTA AAC AG A ATC CAT TAG ATG GCA CCA 
Thr Lya Thr Arg Gly Lya Phe Kia Phe Leu Aan Arg lie Hia Tyr Met Ala Pro 

198 

603 612 621 630 639 648 

AGO AAT GAA CCA TGG GOT GAA CAT GAA ATT GAT TAC ATC CTA TTT TAT AAG ATC 
Ser Aan Glu Pro Trp Gly Glu Hia Glu He Aap Tyr He Leu Phe Tyr Lya He 

216 

657 666 675 684 693 702 

AAC GCT AAA GAA AAC TTG ACT GTC AAC CCA AAC GTC AAT GAA GTT AG A GAC TTC 
Aan Ala Lya Glu Asa Leu Thr Val Aan Pro Aan Val Aan Glu Val Arg Aap Phe 

234 

711 720 729 738 747 756 

AAA TGG GTT TCA CCA AAT GAT TTG AAA ACT ATG TTT GCT GAC CCA AGT TAC AAG 
Lya Trp Val Ser Pro Aan Aap Leu Lya Thr Met Phe Ala Aap Pro Ser Tyr Lya 

252 

765 774 783 792 801 810 

TTT ACG CCT TGG TTT AAG ATT ATT TGC GAG AAT TAC TTA TTC AAC TGG TGG GAG 
Phe Thr Pro Trp Phe Lya He He Cya Glu Asa Tyr Leu Phe Aan Trp Trp Glu 

270 

819 828 837 846 855 864 

CAA TTA GAT GAC CTT TCT GAA GTG GAA AAT GAC AGO CAA ATT CAT AG A ATG CTA 
Gin Leu Aao Aap Leu Ser Glu Val Glu Aan Aap Arg Gin He Hia Arg Met Leu 

288J 

867 



31 



EP 0 769 551 A1 




32 

BNSOOCIO- <EP 0769551 Al _l_> 



EP 0 769 551 A1 



FIG. 11 
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FIG. 12 
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FIG. 13 
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FIG. 14 
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